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Abstract 

This paper studies the distribution of the component spectrum of 
combinatorial structures such as uniform random forests, in which the 
classical generating function for the numbers of (irreducible) elements 
of the different sizes converges at the radius of convergence; here, 
this property is expressed in terms of the expectations of independent 
random variables Zj, j > 1, whose joint distribution, conditional on 
the event that YTj=\3^j = n -> gives the distribution of the component 
spectrum for a random structure of size n. For a large class of such 
structures, we show that the component spectrum is asymptotically 
composed of Zj components of small sizes j, j > 1, with the remaining 
part, of size close to n, being made up of a single, giant component. 

1 Introduction 



In this paper, we consider the distribution of the asymptotic component spec- 
trum of certain decomposable random combinatorial structures. A struc- 
ture of size n is composed of parts whose (integer) sizes sum to n; we let 
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:= (C[ n \ C^, . . . , C^) denote its component spectrum, the numbers 
of components of sizes 1,2, ... ,n, noting that we always have J2]=i jCf^ = n. 
For each given n, we assume that the probability distribution on the space 
of all such component spectra satisfies the Conditioning Relation: 

C(CW) = C ({Z,, Z 2 ,..., Z n ) I f^jZi = nj , (1.1) 

where Z := (Zj, j > 1) is a sequence of independent random variables, the 
same for all n; that is, for yi, y 2 , . . . , y n G Z + , 

^ n I n \ 

nm—^iwiE^j- (i-2) 

This apparently curious assumption is satisfied by an enormous number of 
classical combinatorial objects, such as, for instance, permutations of n ob- 
jects under the uniform distribution, decomposed into cycles as components, 
when the Zj are Poisson distributed, with Zj ~ Po (1/j); or forests of unla- 
beled unrooted trees under the uniform distribution, decomposed into tree 
components, when the Zj are negative binomially distributed: see Arratia, 
Barbour & Tavare (2002, Chapter 2)[ABT] for many more examples. How- 
ever, such structures also arise in other contexts. For instance, the state of 
a coagulation-fragmentation process evolving in a collection of n particles 
can be described by the numbers Cj™' ) of clusters of size j, 1 < j < n, and 
if such a process is reversible and Markov, then its equilibrium distribution 
satisfies the conditioning relation for some sequence Z of random variables. 
In particular, under mass action kinetics, it follows that Zj ~ Po (aj), where 
(aj, j > 1) are positive reals, determined by the coagulation and fragmenta- 
tion rates; see Whittle (1965), Kelly (1979, Chapter 8), Durrett, Granovsky 
& Gueron (1999) and Freiman & Granovsky (2002a). 

In order to describe the asymptotics as n — > oo, it is necessary first to 
say something about how the distributions of the Zj vary with j. Now the 
distribution given in (1.2) remains the same if the random variables Zj are 

(x) 

replaced by 'tilted' random variables Zj , where 

W[zf ] =i} = W[Zj = i}x Jl /kj{x), (1.3) 



2 



for any x > such that 

kj(x) := E<jV^} < oo. 

Specializing to the setting in which Zj ~ Po (dj) for each j, this means that 
exactly the same distributions are obtained for each n in (1.2) if dj is replaced 
by djxi for each j, for any fixed x > 0. Thus geometrically fast growth or 
decay of the Oj can be offset by choosing x~ x = lim^oo (should the limit 
exist), without changing the asymptotics. Hence, to find an interesting range 
of possibilities, we look at rates of growth or decay of EZj which (if necessary, 
after appropriate tilting) can be described by a power law: JEZj ~ Aj a as 
j — > oo, or, more generally, EZj regularly varying with exponent a G H. 

Three ranges of a can then broadly be distinguished. The most intensively 
studied is that where a = — 1, and within this the logarithmic class, in which 
lEZj ~ TP[Zj = 1] ~ 6j~\ for some 9 > 0: see the book [ABT] for a detailed 
discussion. For a > — 1, the expansive case, the asymptotics were explored 
for Poisson distributed Zj in Freiman & Granovsky (2002a,b), with the help 
of Khinchine's probabilistic method, and particular models have been studied 
by many authors. Here, we treat the convergent case, in which a < —1, in 
considerable generality. Our approach is quite different from the classical 
approach by way of generating functions, thereby allowing distributions other 
than the standard Poisson and negative binomial to be easily discussed. Note 
also that not all classical combinatorial structures fall into one of these three 
categories: random set partitions, studied using the Conditioning Relation 
by Pittel (1997), have Poisson distributed Zj with means x^ which are 
never regularly varying, whatever the choice of x > 0. 

As will be seen in what follows, a key element in the arguments is estab- 
lishing the asymptotics of the probabilities W\T bn (Z) = I] for / near n, where, 
iory := {y u y 2 ,...) E Z~ 

TUv) ■= E m, 0<b<n. (1A) 
j=b+l 

That this should be so is clear from (1.2), in which the normalizing constant 
is just the probability P[To n (Z) = n], and is the only element which can- 
not immediately be written down. In the context of reversible coagulation- 
fragmentation processes with mass-action kinetics, the partition function c n 
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investigated by Freiman & Granovsky (2002a) is given by 



c n : = exp < a i ( &[Ton(Z) = n], 




(1.5) 



explaining its relation to many of their quantities of interest. Now, in the 
expansive case, taking Poisson distributed Zj with means dj ~ Aj a , a > — 1, 
one has 



The Bernstein inequality then implies that, for large n, the probability 
P[To n (Z) = n] is extremely small, making a direct asymptotic argument 
very delicate. However, recall from (1.3) that the conditioning relation (1.1) 
delivers the same distribution for the combinatorial structure if the Pois- 
son distributed random variables Zj with means dj are replaced by Pois- 
son distributed random variables Z^ with means djx\ for any x > 0. 
Choosing x = x n in such a way that ]ET 0n (Z^) = n makes the probability 
JP[To n (Z^) = n] much larger, and a local limit theorem based on the normal 
approximation can then be used to determine its asymptotics. The resulting 
component spectra typically have almost all their weight in components of 
size about few smaller components making up the rest. 

For the logarithmic case, taking Poisson distributed Zj with means dj ~ 
9/j, 9 > 0, one has 



so that no tilting is required. However, since To n (Z) > 0, these asymptotics 
also imply that C(n~ 1 T Qn (Z)) is not close to a normal distribution — there 
is a different limiting distribution that has a density related to the Dickman 
function from number theory — and special techniques have to be developed 
in order to complete the analysis. Here, the component spectra typically 
have components of sizes around nP for all < f3 < 1. 

In the convergent case, taking Poisson distributed Zj with means dj ~ 
Aj a , a < —1, the sequence of random variables T 0n (Z) converges without 
normalization, and both the methods of proof and the typical spectra as 
n — * oo are again qualitatively different. We demonstrate that, for large n, 



JET 0n (Z) x n 2+a > n and SD (T Qn (Z)) x ri 



(3+a)/2 



< JET 0n (Z). 



ET 0n (Z) ~ n9 and SD (T 0n (Z)) x n, 
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the typical picture is that of small components whose numbers have the inde- 
pendent joint distribution of the Zj, the remaining weight being made up by 
a single component of size close to n. This remains true without the Poisson 
assumption, under fairly weak conditions; for instance, our theory applies to 
the example of uniform random forests, where the asymptotic distribution 
of the size of the largest component was derived using generating function 
methods by Mutafchiev (1998). Bell, Bender, Cameron and Richmond (2000, 
Theorem 2) have also used generating function methods to examine the con- 
vergent case for labelled and unlabelled structures, which, in our setting, 
correspond to Poisson and negative binomially distributed Z/s, respectively; 
we allow an even wider choice of distributions for the Zj. They use somewhat 
different conditions, and are primarily interested in whether or not the prob- 
ability that the largest component is of size n has a limit as n — > oo, though 
they also consider the limiting distribution of the number of components. 
Under our conditions, these limits always exist. 

2 Results 

We work in a context in which the random variables Zj may be quite general, 
provided that, for large j, their distributions are sufficiently close to Poisson. 
From now on, we use the notation aj := JEZj, and then write aj = j~' 3 ~ 1 A(j) 
for q = —a — 1 > in the convergent case, where the quantities X(j) are 
required to satisfy certain conditions given below. 

Since now a,j — > 0, being close to Poisson mainly involves assuming that 
1P[Zj > 2] <C 1P[Zj = 1] as j — > oo, so that the Zj can be thought of as 
independent random variables which usually take the value 0, and occasion- 
ally (but only a.s. finitely often) the value 1. This setting is broad enough to 
include a number of well known examples, including uniform random forests 
consisting of (un)labelled (un)rooted trees. In such circumstances, we are 
able to use a technique based on recurrence relations which are exactly true 
for Poisson distributed Zj, and which can be simply derived using Stein's 
method for the compound Poisson distribution (Barbour, Chen and Loh, 
1992). A corresponding approach is used in [ABT], though the detail of the 
argument here is very different. 

In describing the closeness of the distributions of the Zj to Poisson, we 
start by exploiting any divisibility that they may possess, supposing that 
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each Zj can be written in the form Zj = J2k=i Zjk for some Tj > 1, where, for 
each j, the non-negative integer valued random variables (Zjk, 1 < k < rj) 
are independent and identically distributed. Clearly, this is always possible if 
we take rj = 1. However, Poisson distributions are infinitely divisible (rj may 
be taken to be arbitrarily large), and the error bounds in our approximations 
become correspondingly smaller, if we are able to choose larger rj. Note, 
however, that negative binomially distributed Zj also have infinitely divisible 
distributions, so that closeness to Poisson is not a consequence of infinite 
divisibility alone. We now define (ej S , s,j > 1) by setting 



r,P[^ 1 = l] =: j-^\(j)(l- £jl 



r 3 W\Z 3X = s\ =: j " 'Ai./) s>2, (2.1) 



so that then 



< Eji = se js < 1, 

s>2 

because j _<?_1 A(j) = aj = JEZj = rjJEZj 1 . We then assume that 

< e js < e(jhs, s > 2, (2.2) 

where 

G := V" S7s < oo and lim e(j) = 0; (2.3) 

we write e*(j) := max/>j + i e{l) and r*(j) := min/ >: , r/. For the subsequent 
argument, we need to strengthen (2.3) by assuming in addition that 

G,:=£L a s 1+9 7a <oo, where L s := sup{A( [l/s\ )/A(Z)}. (2.4) 

s>2 1 > S 

We also need some conditions on the function A. We assume that 

A + (/) := max A(s) = o(/ /3 ) for any (3 > 0; (2.5) 

1<S<1 

L := sup max {X(l - t) / X(l)} < oo, (2.6) 

l>2 l/2<t<l 

and that 

lim{A(/-s)/A(/)} = 1 foralls>l; (2.7) 

l— too 
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note that, if A is slowly varying at infinity, then conditions (2.5)-(2.7) are 
automatically satisfied, and that L s defined in (2.4) is finite. We then write 
:= max/>! Z _/3 A(Z) for f3 > 0, and we also observe that 

TP[Zjk > 1 for oo many j, k] — and hence that T QoD (Z) < oo a.s., (2.8) 

from (2.1), (2.5) and the Borel-Cantelli lemma. Finally, we assume that the 
distributions of the random variables Zji of (2.1) are such that 

Po := minP[Z,-i = 0] > 0. (2.9) 

This restriction can actually be dispensed with — see Remark 3.2 — but it 
makes the proofs somewhat simpler. 

We are now in a position to state our first theorem, in which the asymp- 
totics of the probabilities P[T bn (Z) = I] are described. 

Theorem 2.1 Suppose that conditions (2.2) - (2.7) are satisfied for some 
q > and that (2.9) holds. For 1 < I < n, define 

HJl) := max {X' 1 (ni^PlT^ = 1} - 11. 

0<fc<Z-l 

Then H{1) := sup n> ;if n (/) satisfies lim^ 00 if(/) = 0. 

Note that the condition G q < oo of (2.4) is really needed here: see Re- 
mark 3.4. 

As is strongly suggested by the formula (1.2), Theorem 2.1, in giving 
the asymptotics of P[To„(Z) = n], can directly be applied to establish the 
asymptotic joint distribution of the entire component spectrum. This is given 
in the following theorem. For probability distributions on a discrete set X, 
we define the total variation distance dxv by 

d TV (P,Q) := sup \P(A)-Q(A)\. 

ACX 

Theorem 2.2 Suppose that conditions (2.2) - (2.7) are satisfied for some 
q > 0, and that (2.9) holds. Then 

lim d T v(£(C (n) ),Q n ) ^0, 

n—>oo 

where Q n is the distribution of (Zi, Z 2 , . . . , Z n ) + e(n — T 0n (Z)), and e(j) 
denotes the j 'th unit n-vector if j > 1, and the zero n-vector otherwise. 
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Theorem 2.2 has a number of immediate consequences, which all follow 
directly because T QoD (Z) < oo a.s. 

Corollary 2.3 

(a) For any fixed k > 1, 

C(C[ n \... : C^)^jC(Z 1: ... : Z k ) as n^oo. 

(b) IfY n := max{j : > 0} and K n := min{j : > 0} are the sizes of 
the maximal and minimal components of the spectrum, then, as n — > oo ; 

C(n - Y n ) - C(T 0oo (Z)) 

and 

p[i n >6Hl]p[z 3 = o] 

for any b > 1. In particular, it follows that 

UmF[y n = iir n = n] = IjF[Zj = 0]. (2.10) 

(^cj TTie asymptotic distribution of the number of components X n of the spec- 
trum is given by 

Remark 2.4 The assertion (a) of the above corollary states the asymptotic 
independence of the numbers of components of small sizes, a fact that has 
also been established in [ABT] in the logarithmic case, and also in the Poisson 
setting for q > in Freiman and Granovsky (2002b). This fact can be viewed 
as a particular manifestation of the heuristic general principle of asymptotic 
independence of particles in models of statistical physics. 

Assertion (b) says that, as n — > oo, the structures considered exhibit the 
gelation phenomenon; the formation, with positive probability, of a compo- 
nent with size comparable to n (see, for example, Whittle (1986), Ch. 13). 
Gelation also occurs in the logarithmic case [ABT], while it is not seen for 
q > in the setting of Freiman and Granovsky (2002b). In this sense, q = 
(a = —1) represents a critical value of the exponent. 
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Now JP[Y n = n] is the probability that a structure is 'connected', as, for 
instance, in Bell, Bender, Cameron and Richmond (2000), who give a very 
general discussion of circumstances in which p : = lim^c*, IP^ = n] exists, 
as well as giving a formula for the asymptotic distribution of X n . They work 
in the settings of either labelled or unlabelled structures; in our terms, they 
assume that the Zj have either Poisson or negative binomial distributions, 
respectively. Theorem 2.2 implies that p always exists under our conditions, 
and gives its value. 

Example. We apply our results to some classical models of random forests, 
referring for a discussion of the literature to the books of Pavlov (2000) 
and Kolchin (1998); see also Mutafchiev (1998, pp. 212-213). We begin by 
considering the uniform distribution over all forests of unlabelled, unrooted 
trees. The number rrij of such trees of size j was studied by Otter (1948), 
who showed that rrij ~ cp^j -5 / 2 , where p < 1, and gave values for both p 
and c. This combinatorial structure satisfies the conditioning relation with 
negative binomial random variables Zj ~ NB {rrij, pi), so that 



w[z j = s] = (i- / py 



rrij + s 



s 




It thus follows that EZj = rrijfp / (1 — pP) ~ cj -5 / 2 , implying that our results 
can be applied with \{j) — > c and q = 3/2. Note that, if we take r-j = 1 for 
all j, we have 

W[Z 3 = 2] = (1 - fPT> ( mj 2 + x {rrijfP) 2 , 

so that €j2 x j~ 5 / 2 as j — > oo. On the other hand, negative binomial distri- 
butions are infinitely divisible, and other choices of Tj in (2.1) are possible: 
for each j, we can take Zjk ~ NB (rrij/rj, pi), 1 < k < Tj, for any choice 
of r-j. The corresponding values of Ej S , s > 2, are then given, using (2.1), by 

r,P[Z, 1 = S ] = rj(l-pJ)^ ^ ^ jffi 

mj (I2i + s-l)- + 1)2* 

= pTp"— —r-^ 

s\ 

= {mjp'/i 1 -ft)}tj8, 
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from which, for fixed j and s > 2, we deduce the limiting value 

of ej S as Tj — > oo. Note that, as j — > oo, £* 2 ~ 2~ 1 p J is of very much 
smaller order than the order j -5 / 2 obtained for when taking rj = 1. As 
a result, many of the contributions to the bound H(l) of Theorem 2.1 for 
the relative error in approximating IP[T5 n = I] are reduced. These include 
the terms arising from r) f , r\\ and r] 2 , which enter in (3.15) and (3.16) below; 
furthermore, as observed in Remark 3.3, letting rj — * oo also allows us to 
take po — 1 and rjk(l) = 0, 3 < k < 6. 

Similar arguments can be used for forests of unlabelled, rooted trees, now 
with rrij ~ c'p _j, j~ 3 / 2 . For forests of labelled, (un)rooted trees, £(T 0oo ) is the 
compound Poisson distribution of J2j>ijZj, where 

z ^ Fo {w) (unrooted); z > ^ |,() (777 

The asymptotics of C(n — Y n ) then implied by Corollary 2.3 do not appear 
to agree with those of Mutafchiev (1998). 



(rooted). 



3 Proofs 

3.1 The perturbed Stein recursion and the basic lemma 

Stein's method for the Poisson distribution Po (a) is based on the Stein-Chen 
identity 

m{zf(z)} = amf(z + i), 

true for all bounded functions / : Z + — > 1R when Z ~ Po (a) ; this can be 
checked by writing the expectations on each side of the equation as sums, 
and then examining the coefficients of /(/) for each / > 0. In particular, it 
then follows that 

JEijZjgijZj)} = .,a ,¥.<,{.,/, ■ ./) 

if Zj ~ Po (dj), by putting f(l) = g(jl). Hence, for the compound Poisson 
distributed weighted sum 

T b * n := TUZ) = E jZ jt 
j=b+i 
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when Zj ~ Po (dj) and the are all independent, we deduce the Stein 
identity 

n 

e{t^(t 6 ;)}= ]T j % iE^r 6 ;+i), (3.1) 
j=b+i 

true for all bounded functions <? : Z + — > 1R and for any < 6 < n. Taking 
p = for any Z > b + 1, it thus follows that 

raw = fl = E r 9 A(i)p[r 6 ; = z - j\ 
j=b+i 

lAn 

= £ r 9 A(j)IP[T 6 ; = Z - j], Z > 6 + 1; (3.2) 
j=b+l 

note that this recursion can also be deduced directly by differentiating the 
compound Poisson generating function, and equating coefficients. Recur- 
sion (3.2), coupled with the fact that IP[T b * n = I] = for 1 < I < b, suc- 
cessively expresses the probabilities IP[T b * n = I] in terms of the probability 
P[T 6 *„ = 0]. In particular, if I < n is large and if {r q \(j)}/{l' q \(l)} is close 
to 1 when j is close to /, it suggests that 

IV[TL = I] « l- q mnTL < l - b - 1] « Z-«A(Z), 

giving the large / asymptotics for JP[T fe * n = I]. Our approach consists of 
turning this heuristic into a precise argument, which can be applied also 
when the Zj do not have Poisson distributions. 

Observing that the Stein identity (3.1) is deduced from the Stein-Chen 
identity 

nzMTL)} = r^KmigPL + j)}, b + 1 < j < n, (3.3) 

when Zj ~ Po (j _1_9 A(j)), our first requirement is to establish an analogue 
of (3.3) for more general random variables Zj. To do so, as in the previous 
section, we suppose that each Zj can be written in the form Zj = Y%=i Zjk 
for some Tj > 1, where, for each j, the non-negative integer valued random 
variables (Zj^, 1 < k < Tj) are independent and identically distributed. 
Then, writing T bn := T bn (Z), it is immediate that 

E{^(T 6n )} = 5>P[S;i = s]Eg{T b ^ + js), 

S>1 
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where := T bn — jZj 1 , so that, with the above definitions, 

JE{T bn g(T bn )} = jh JE{jZ j9 (T bn )} (3.4) 
j=b+i 

n 

= ]T Jn E *v[Zn = ^g(T^ ] + js) 

j=b+l s>l 
n 

j=b+i 

n 

+ E r q Kjm-en)mT^+j)-mT bn +j)} 
j=b+i 

n 

+ E Hr q mse 3 MgiTS+js). (3.5) 

j=b+l s>2 

Taking g = 1^ as before then gives the recursion 

l/\n 

iw[r bn = i] = ]r r q mnT bn = 1-3] 
j=b+i 

+ E r g A(j){(l-e,i)]P[T£ ) = /-j]-P[T 6n = /- J ]} 
j=b+i 

L(V2)Anj 

+ E Er 9 A0>i.F[r£ ) = z-j a ], (s.e) 

j=b+l s>2 

which can be understood as a perturbed form of the recursion (3.2). 

In order to show that the perturbation is indeed small, it is first necessary 
to derive bounds for the probabilities !P[T{, ra = s] and IPp - ^ = s]. However, 
since P[T 6 „ = s] > P[Zji = 0] IP = s], we have the immediate bound 

F[Tl£ = s]< Po 1 nT bn = s], s = 0,l,..., (3.7) 

where p > is as in (2.9). Hence the following lemma is all that is required. 

Lemma 3.1 Suppose that conditions (2.2) - (2.7) are satisfied for some 
q > 0, and that (2.9) holds. Then there exists a constant K > 0, depending 
only on the distributions of the Zj, such that 

P[T 6n = l}< K\{l)l~ 1 - q ) I > 1. 
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Proof. For 1 < I < b, the statement is trivial. For larger /, we proceed 
by induction, using the recursion (3.6), in which, on the right hand side, 
probabilities of the form IP[T fen = s] appear only for s < I, so that we may 
suppose that then P[T 6n = s] < K\(s)s^ 1 ^ q for all 1 < s < I. Under this 
hypothesis, we split the right hand side of (3.6) into three terms, which we 
bound separately; we take the first two lines together, and then split the 
third according to the value taken by js. 

For the first term, we use (3.7), the induction hypothesis and conditions 
(2.5) and (2.6) to give 

/An 

E nXU){l-e jl )P[T^=l-j] 
j=b+l 

m\ i 

< Er 9 A(j> 1 P[T feri = /-j]+ ]T j- q \(j)to 1 V[T bn = l-j] 

J=l J=L'/2J+1 

[1/2} 

< P - Q i \ + {yi/2\)KL\m/i) l+q r q +v, l L\m/i) q 

= irA(/)/-%(/)+p 1 LA(/)(2//) 9 , (3.8) 
where 

LI/2J 

77o(0 :=p 1 2 1+ "A + (L//2j)L/- 1 £ r 9 = as / ^ oo. 

j'=i 

For the second term, arguing much as before, we have 

L(J/2)AnJ 

E E lo^L^jy-^oo^.p^ = j - j S ] 

j=b+l s>2 
LI/2J 

< E E 1 {^<LV2J}r 9 A(j>5^ 1 KLA(/)(2//) 1+ " 

LI/2J 

< A(/)/-^ 1 2 1+ ^A + (L//2j)™- 1 J] j-%(j)G 

< e*(0)GK\(l)r q r] (l). (3.9) 
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For the third and final term, we have 

L(V2)AnJ 

E E M\im< 3S <i } 3~ q K3)se 3 snTtt = I ~ js] 

j=b+l s>2 

I [l/s\-l I 

< E E r q mse js nT^^l-js] + j:[l/srX([l/s\)se ll/si , s 

s=2 j=|_Z/2sJ+l s=2 

= S 1 + S 2 , (3.10) 
say. Now 

i LV»J-i 

Si < E E r^oo^OOT.Po^Aa-js)^-^)- 1 ^ 

s=2j=L«/2sJ+l 

/ 

< P v 1 KY,(l/Zs)- q L s L\(l)s ls s\\l/2s\)R q s- 1 - q /\ (3.11) 

s=2 

where i? ? := A q / 2 J2t>i t~ 1 ~ q ^ 2 , and this implies that 

Si < KX(l)r q r]i(l), (3.12) 

where 

:= p^LR q 2 q mm \e*([l/2t\)i2s q/2 L sls + e*((}) £ W 2 L S% 

- " [ s=2 s>t+l 

= o(l) as / — > oo, 

in view of (2.3) and (2.4). For S 2 , we have 

i 



S2 < J2^/^r q M[i/sM[i/s\)s ls 

s=2 

< \(l)r«J2s 1+ «L sls e(ll/s\) 

s>2 

■ = A(0r%(0, (3.13) 

where 772(0 = o(l) as / — > 00, again in view of (2.3) and (2.4). 
Collecting these bounds, we can apply (3.6) to show that 

nP[T b „ = Z] < X(l)r^L P ^ 1 + r, 2 (l) + K[r ]o (l)(l + e%0)G)+r] 1 (l)}} ] (3.14) 
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and this in turn is less than KX(l)l q provided that 



K{1 - [ito(J)(l + e*(0)G) +vi(l)}} > 2 q Lp^ + r) 2 (l), 

which can be achieved uniformly for all I > Iq, for some large Iq, by choosing 
K > 2 q+1 Lp \ As observed before, P[T 6n = 1} = for 1 < I < b. For 
b + 1 < / < Z , we can suppose that JP[T bn = t] < K l _ 1 \{t)t- 1 - q for all 
t < I - 1, and deduce from (3.14) that JP[T bn = t] < K l \{t)t- 1 - q for all t < /, 
if we take 

K x = m a x{K l . u 2 q L Po 1 + m (l) + ^_ 1 [t 7o (Z)(1 +e*(0)G) + m(l)]Y, 
this then completes the proof. o 

Lemma 3.1, together with the bounds derived in the course of its proof, 
are enough to enable us to exploit the recursion (3.6), and thereby to prove 
Theorems 2.1 and 2.2; the detailed argument is given in the next two sections. 

3.2 Proof of Theorem 2.1 

We exploit the recursion (3.6), observing first that the contribution from its 
last line was bounded in the proof of Lemma 3.1 by 

X(l)r q {e*(0)GK VQ (l) + K Vl (l) +r] 2 (l)}, (3.15) 

uniformly in < b < I — 1 . We now need to examine the second line in more 
detail. First, note that, by Lemma 3.1, for / < n, 

/An 

E r q me^\T b ^ = i-j] 
j=b+i 

[J/2J 

< ]T j- q X(j)Ge(j)p^(2/l) 1+q KLX(l) + Po 1 Ge*([l/2\)L\m/l) q 

< \(l)n(KGe*(0) m (l)+r,' o (l)), (3.16) 
where 

r]' (l) := 2 q p Q 1 GLe*{\l/2\) = o(l) as / -> oo. 
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The remaining part of the second line of (3.6) is then bounded by 

lAn 

E r q mmTii ] = 1-3]- nn n = i-m 
j=b+i 

- e j-'mmrfU = i-j]-e = *]pp£> = ; - ^ + 1)]} 

j=b+l s>0 

< Er 9 A( J ){p[^i>i]]p[T£ ) = /-j] 

+ E p [^i = s]p[t{£ = * - K* + 1)]}- (3-17) 

S>1 

We now observe, using Lemma 3.1, (3.7), (2.1) and (2.6), that 

ErAOWiiMi^^i-ji 

L'/2J 

< E r- 1 r 1 - 29 A 2 (i) Po - 1 XLA(0(2/0 1+9 
j'=i 

+ {r*(//2)}-Vo 1 {^(/)} 2 (2//) 1+ ^ 
:= A(Z)Z"%(Z), (3.18) 

where clearly ^(Z) = o(l) as Z — > 00. Then we also have 
Er ff A(j)F[^-i = l]F[T£ ) = Z-2j] 

< E r- 1 r 1 - 29 A 2 (i)p - 1 XLA(Z)(2/Z) 1 ^ 
j'=i 

+ {r*(Z/4)}-Vo 1 {^ 2 A(Z)} 2 (4/Z) 1+2 " 
:= X(l)r q m(l), (3.19) 

again by Lemma 3.1, where also ^(Z) = o(l) as Z — > 00. The remaining piece 
of the last term in (3.17) is split into two, as in the proof of the previous 
lemma, though the argument is a little simpler. The bound 

E r q m E Wi)<L*/2j}m-i = sippff = 1 - j(s + 1)] 

3=1 s>2 
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< p^KLXm/iy+^r-'j-^X^GeU) 

3=1 

:= A(Z)r%(Z), (3.20) 

with 775 (Z) = o(l) as / — > 00, follows immediately. For the second part, we 
have 

D'" ? A(j) £ l{LV2j<^+i)<0 P [^i = 'l^PiS? = ' " j( s + !)1 

J'=l s>2 

1-1 [i/(»+i)J 

< Pb'E E r7 1 A 2 0-)r 1 - 2, e(j)7JP[T ta = Z-j( a + l)] 

s=2j=L//2(s+l)J+l 

< Pb 1 5>*('/2(* + l))}"V(Li/2( a + 1)J)L 2 L S A(Z)A ?/2 {2( S + l)//} 1+3 ^ 2 7s 

s=2 

< {r*(0)}' 1 5*(0K 1 3 1+3 ^ £ L 2 L s A(/)A g/2 ( S //) 1+5 7s 

< A(Z)Z"%(Z), (3.21) 
with 

77 6 (Z) := {r*(0)}- 1 e*(0K 1 3 1+3g / 2 L 2 A (?/2 G' g Z- 1 = o(l) as Z ^ 00. 

Combining the results from (3.15) - (3.21), it follows from (3.6) that, for 
I < n, 

lJP[T hn = Z] = E r 9 A(i)P[T 6n = Z - j] + X(l)r%{l), 
j=b+i 

where r] 7 (l) = o(l) as Z — > 00. Hence we deduce that 

\- 1 (l)l 1+ «r[T bn = l] (3.22) 

= P[T ta <!-»-!] + g' {i!^ - l} PP1. = .] + 
In view of (2.7), we can find a sequence si — > 00 such that = o(Z) and 



max 

1<S<S( 



A(Z - g) 
A(Z) ' 



o(l) as Z — > 00 : 
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hence also 

E 

s=0 



Z*A(Z - s) 



- 1 



(l-s)«\(l) 
It then follows from (2.6) and (2.8) that 
l q X(l - s 



P[T 6n = s] = T) S (l) = o(l) as / ^ oo. 



L'/2J 

E 

S = S; + 1 



lP[T 6n = s] < (2 9 L + l)P[T fen > s z ] 



(Z - s) q X(l) 

< (2«L + l)P[T 0oo > Sj ] = 77 9 (Z) = o(l) as Z ^ oo. 



For the remaining sum, we use Lemma 3.1 to give 

Z"A(Z - s) 



l-b-l 

E 

s=U/2j+l 



(l-s)«\(l) 



bn 



1 LV2J i,W \ 

<KL\milf +q {^Y. 



L»/2J 



< KL2 q \ q/2 y- q/2 + (2/Z) « _9/2 
= ^io (0 — as Z ^ oo. 

Putting these estimates into (3.22), it follows that, for 1 < Z < n, 
\-\l)l 1+q JP[T b n = I] = 1 - > I - b - 1] + 77n(Z), 

where 7711 (Z) = o(l) as Z — > 00. Finally, since also, for b < |_Z / 2 J , 
P[T bn > l-b-1] < P[T 0oo > 1/2] ^ as Z -> 00, 
whereas, for [Z/2J < b < Z, 

P[T 6n >Z-6-l] < P[T 6oo > 0] < F[T L , /2J)00 > 0] 

00 

< E A(j)r 1_9 ^0 asZ^oo, 
i=L'/2J 



(3.23) 



(3.24) 



(3.25) 
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it follows from (3.24) that, for all n > I and < b < I — 1, we have 

\\-\l)l 1+c >P[T bn = I] - 1\ < H(l), 
where lim^oo H{1) = 0, as required. o 

Remark 3.2 The assumption (2.9), that p > 0, can be dispensed with, 
whatever the distributions of the Zj, provided that (2.3) holds. Clearly, for 
some m > 1 and ti, . . . , t m , we have 

p' n := min <^ min JPlZ^ = 01, min PfZ,-i = tA > > 0, 

[j>m+l J l<j<m J J ) 

since lim^oo EZj = 0. Then, for j < m and s > t,-, we have the simple 
bound 

P[T 6 ( f = I - js] < P[T 6n = I - j(s - ^)]/P[^-i = t,], 

which can be used as before, together with the induction hypothesis, to bound 
the right hand side of (3.6) in the proof of Lemma 3.1, provided that s > tj. 
So, recalling (3.4) with g = lyy, we write 

m 

j2njZji { i } (T bn )} 

3=1 

m 

= T,j r MZ j iI[Z j <t j ]l {l} (T bn )} 

3=1 

m 

+ J2jr j nZ j iI[Z j >t j ]l {l} (T bn )}. 

3=1 

The second term is estimated exactly as before. The first is no larger than 
KW\T bn = I], where 

m 

k := J~]jrjtj, 

3 = 1 

and hence can be taken onto the left hand side of (3.14) whenever / > 2k; 
with these modifications, the proof of Lemma 3.1 can be carried through as 
before. The proof of Theorem 2.1 requires almost no modification, if p is 
replaced by p' . 
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Remark 3.3 If the (Zj, j > 1) are infinitely divisible, then we can choose 
the rj to be arbitrarily large for each fixed j, in the limit making r]k(l) = 0, 
3 < k < 6, and p — 1. The limiting values as Tj — > oo of for fixed j 
and s > 1, are not however in general zero. 

Remark 3.4 The assumption (2.4) that G q be finite is not just an artefact of 
the proofs. It appears in particular when bounding the quantity S2 in (3.13) 
in the proof of Lemma 3.1, and is an element in the quantity 772(0 , which 
contributes to the bound on H(l) in Theorem 2.1. However, Z" 1 ^ is of 
the same order as the probability that T 0n is composed of s components of 
equal sizes [l/s\, plus a small remainder, for some s > 2, and G q < 00 is 
the condition which ensures that this probability is of smaller order than 
A(Z)/- 1 ^. 



3.3 Proof of Theorem 2.2 



As in [ABT, Lemma 3.1], it follows from the Conditioning Relation that, for 
any b < n, 



<hv{£(Ci n \...,Cjr'), £&,..., Z b )) 

tTtp\t ,1/1 Ppfcn = n-j] 



(3.26) 



Pick b = b{n) with n — b{n) — > 00, and observe that the right hand side 
of (3.26) is at most 

P[T 06 >i„]+IE^(To 6 ), 
where g n (j) = for j > j n and where, for all n such that H{n) < 1/2, 



< 9n(j) < 



n 1+q \(n-j) 



(n-j) 1+ "A(n) 



- 1 



+ 2 1+ iL2(H(n) + H(n-j)), < j < j n , 



from Theorem 2.1, provided that < j n < [n/2\ and that j n < n — b(n) — 1. 
This implies in particular that g n (j) is uniformly bounded for sequences j n 
satisfying these conditions. Now, from (2.7) and Theorem 2.1, it follows that 
lim n ^oo g n (j) = for each fixed j. Since also To& < Tooo a.s. and Tooo is 
a.s. finite, it follows by dominated convergence that lim^oo E<7 n (T 6( n )) = 0, 
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provided that, in the definition of g n , j n < min{n — b(n) — 1, [n/2\ }. On the 
other hand, 

n^n) > jn] < IP[Tooo > jn] ~> 0, 

so long as j n — > oo. Thus, taking for example 6(n) = |_3r7./ 4J and j n = 
[n/A\ — 1, it follows that 

d T v{£{C{ n \ C^),C(Zi, . . . , Z 6(n ))) — > 

as n — > oo. On the other hand, we have Z)j=L3n/4j+i Ci"" 1 — 1 a - s -> because 
2~'on(C <n ' ) ) = ^ a.s., by the definition of C^ n \ Hence, with b(n) as above, we 
have Cf ] = a.s. for all j > b(n) if T 06(n) (C (n) ) = n, while if T ob{n) (C (n) ) = t 

for some t < n — b(n), then C^l t = 1 and C| n ^ = for all other j > b(n). 
This proves the theorem. o 
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